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Abstract 

This paper investigates the problem whether the difference between two paramet¬ 
ric models mi, m 2 describing the relation between a response variable and several 
covariates in two different groups is practically irrelevant, such that inference can be 
performed on the basis of the pooled sample. Statistical methodology is developed 
to test the hypotheses Hq : d{mi,m 2 ) > £ versus Hi : d(mi,m 2 ) < £ to demonstrate 
equivalence between the two regression curves mi, m 2 for a pre-specified threshold 
e, where d denotes a distance measuring the distance between mi and m 2 . Our ap¬ 
proach is based on the asymptotic properties of a suitable estimator (i(mi, m 2 ) of this 
distance. In order to improve the approximation of the nominal level for small sample 
sizes a bootstrap test is developed, which addresses the specific form of the interval 
hypotheses. In particular, data has to be generated under the null hypothesis, which 
implicitly dehnes a manifold for the parameter vector. The results are illustrated by 
means of a simulation study and a data example. It is demonstrated that the new 
methods substantially improve currently available approaches with respect to power 
and approximation of the nominal level. 

Keywords and Phrases: dose response studies; nonlinear regression; equivalence of curves 
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1 Introduction 


Testing statistical hypotheses of equivalence has grown signihcantly in importance over the 
last decades, with applications covering such different areas as comparative bioequivalence 
trials, evaluating negligible trend in animal population growth, and model validation; see, 
for example. Cade (2011) and the references therein. Equivalence tests are based on a null 
hypothesis that a parameter of interest, such as the effect difference of two treatments, 
is outside an equivalence region dehned through an appropriate choice of an equivalence 
threshold, denoted as e in this paper. If the null hypothesis is rejected one can then claim at 
a pre-specihed signihcance level a that, in the previous example, the two treatments have 
an equivalent effect [see Wellek (2010)]. Equivalence testing is often used in regulatory 
settings because it reverses the burden of proof compared to a standard test of signihcance. 
In this paper, we consider the problem of establishing equivalence of two regression models 
which are used for the description of the relation between a response variable and several 
covariates for two different groups, respectively. That is, the objective is to investigate 
whether the difference between these two models for the two groups is practically irrele¬ 
vant, so that only one model can be used for both groups based on the pooled sample. 
Such problems appear for example in population pharmacokinetics (PK) where the goal 
is to establish bioequivalence of the concentration prohles over time, say mi, m 2 , of two 
compounds. Traditionally, bioequivalence is established by demonstrating equivalence be¬ 
tween real valued quantities such as the area under the curve (AUC) or the maximum 
concentrations (Cmax) [see Chow and Liu (1992); Hauschke et ah (2007)]. However, such 
an approach may be misleading because the two prohles could be very different although 
they may have similar AUC or Cmax values. Hence it might be more reasonable to work 
directly with the underlying PK prohles instead of the derived summary statistics. 
Another application of comparing two dose response curves occurs when assessing the 
results from one patient population relative to another. For example, the international 
regulatory guidance document ICH E5 (1997) describes the concept of a bridging study 
based on, for example, the request of a new geographic region to determine whether data 
from another region are applicable to its population. If the bridging study shows that 
dose response, safety and efficacy in the new region are similar to another region, then 
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the study is readily interpreted as capable of bridging the foreign data. As a result, the 
ability of extrapolating foreign data to a new region depends upon the similarity between 
the two regions. The ICH E5 guidance does not provide a precise dehnition of similarity 
and various concepts have been used in the literatnre. For example, Tsou et ah (2011) 
proposed a consistency approach for the assessment of similarity between a bridging study 
conducted in a new region and stndies condncted in the original region. On the other hand, 
the ICH E5 gnidance does require that the safety and efficacy prohle in the new region is 
not snbstantially different from that in the original region, and similarity can therefore be 
interpreted as demonstrating “no snbstantial difference”, which results in an equivalence 
testing problem [see Liu et ah (2002)]. 

The problem of establishing eqnivalence of two regression models while controlling the 
Type I error rate has found considerable attention in the recent literatnre. For example, 
Liu et ah (2009) proposed tests for the hypothesis of equivalence of two regression func¬ 
tions, which are applicable in linear models. Gsteiger et al. (2011) considered non-linear 
models and snggested a bootstrap method which is based on a conhdence band for the 
difference of the two regression models [see also Lin et al. (2007a)]. Both references use the 
intersection-union principle [see for example Berger (1982)] to constrnct an overall test for 
equivalence. We demonstrate in this paper that this approach leads to rather conservative 
test procednres with low power. Instead, we propose to directly estimate the distance, say 
d{mi,m 2 ), between the regression curves mi and m 2 and to decide for the eqnivalence of 
the two cnrves if the estimator is smaller than a given threshold. The critical values of 
this test can be obtained by asymptotic theory, which describes the limit distribution of an 
appropriately standardized estimated distance. In order to improve the approximation of 
the nominal level for small samples sizes a non-standard bootstrap approach is proposed. 
In Section 2 we introdnce the general problem of demonstrating the eqnivalence between 
two regression curves. While the concept of similarity of the two prohles is formulated for 
a general distance d, we concentrate in the subsequent discussion on two specihc cases. 
Section 3 is devoted to the comparison of curves with respect to L^-distances. We prove 
asymptotic normality of the corresponding test statistic and construct an asymptotic level- 
a test. Moreover, a non-standard bootstrap procednre is introdnced, which addresses the 
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particular difficulties arising in the problem of testing (parametric) interval hypotheses. In 
particular, resampling has to be performed under the null hypothesis Hq : d{mi,m 2 ) > 
e, which dehnes (implicitly) a manifold in the parameter space. We prove consistency 
of the bootstrap test and demonstrate by means of a simulation study that it yields an 
improvement of the approximation of the nominal level for small sample sizes. In Section 
4 the maximal deviation between the two curves is considered as a measure of similarity, 
for which corresponding results are substantially harder to derive. For example, we prove 
weak convergence of a corresponding test statistic, but the limit distribution depends in a 
complicated way on the extremal points of the difference between the “true” curves. This 
problem is again solved by developing a bootstrap test. The hnite sample properties of the 
new methodology are illustrated in Section 5, where we also provide a comparison with the 
method of Gsteiger et al. (2011). In particular, it is demonstrated that the methodology 
proposed in this paper is more powerful than the test proposed by these authors. The 
methods are illustrated with an example in Section 6. Technical details and proofs are 
deferred to an appendix in Section 7. 

2 Equivalence of regression curves 

We consider two, possibly different, regression models mi, m 2 to describe the relationship 
between a response variable Y and several covariates for two different groups £ = 1,2: 

ye,i,j = me{xe,i, fde) + , 3 = I,..., n^^i, i = l,...,ke. (2.1) 

Here, the covariate region is denoted by df C M'^, xe^i denotes the ith. dose level (in group 
£), the number of patients treated at dose level x^^i and the number of different dose 
levels in group Further, = Yl^Li denotes the sample size in group i, n = ni+n 2 the 
total sample size, and the functions mi and m 2 in (2.1) dehne the (non-linear) regression 
models with pi- and p 2 -dimensional parameters /3i and (32, respectively. The error terms 
are assumed to be independent and identically distributed with mean 0 and variance aj 
for group £ = 1,2. Let (M,d) denote a metric space of real valued functions of the form 
: df —)■ M with distance d. We assume for all /3i, P 2 that the regression functions satisfy 
mi{-,/3i), m 2 {-, ( 32 ) € M, identify the models m^ by their parameters /3£ and denote the 


4 



distance between the two models by d(/3i,/32)(= d{mi,m 2 )). 

We consider the curves mi and m 2 as equivalent if the distance between the two curves is 
small, that is d{l3i, (32) < e, where e is a pre-specihed positive constant. In clinical practice, 
e is often denoted as relevance threshold in the sense that if d{/3i, ( 32 ) < e the difference 
between the two curved is believed not to be clinically relevant. In order to establish 
equivalence of the two dose response curves, we formulate the hypotheses 

Ho : d{/3i,/32) > e versus Hi : d{/3i, (32) < £, (2.2) 

which in the literature are called precise hypotheses, following Berger and Delampady 
(1987). The choice of e depends on the particular problem under consideration. For 
example, when testing for bioequivalence we can conclude that two treatments are not 
different from one another if the 90% conhdence interval of the ratio of a log-transformed 
exposure measure (AUC and/or Cmax; see Section 1) falls completely within the range 
80-125%, indicating that differences in systemic drug exposure within these limits are not 
clinically signihcant [see U.S. Food and Drug Administration (2003)]. For the comparison 
of dissolution profiles, which is a special case of the problem considered in this paper, 
we refer to Appendix I of EMA (2014) with some recommendations for the choice of the 
equivalence threshold on the basis of univariate measures [see for example Yuksel et ah 
( 2000 )]. 

In the following we are particularly interested in the metric space of all continuous functions 
with distances 


doo{/3i,/32) = max\mi{x,/3i) - m2{x,/32)\ (2.3) 

d2{(3i,(32) = [ \mi{x,(3i) - m2{x,(32)\‘^dx. (2.4) 

J X 

The maximal deviation distance doo is of interest, for example, in drug stability studies, 
where one investigates whether the maximum difference in mean drug content between 
two batches is no larger than a pre-specihed threshold; see, for example, Ruberg and Hsu 
(1992) and Liu et ah (2007b). The L^-distance d 2 might be attractive for demonstrating 
similarity of, for example, two PK models because it measures the squared integral of the 
difference between the two curves and is therefore related to the areas under the curves. 
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which in turn is often of interest in bioequivalence studies, as mentioned above. 

The maximal deviation distance (2.3) has also been considered in Liu et al. (2009) and 
Gsteiger et al. (2011), who constructed conhdence bands for the difference of two regression 
curves and used the intersection-union principle to derive an overall test for the hypothesis 
that the two curves are equivalent. In linear models with normally distributed errors this 
test keeps the signihcance level not only asymptotically, but exactly at level a for any hxed 
sample size [see also, Bhargava and Spurrier (2004) or Liu et al. (2008) for some exact 
conhdence bounds when comparing two linear regression models]. However, the resulting 
test turns out to be conservative and has low power, as demonstrated in Section 5. This 
observation can be explained by the fact that the “classical” inversion of a conhdence in¬ 
terval for a parameter, say fx, provides a level a-test for the hypothesis Hq : fj, = 0, but 
it yields usually a conservative test for the hypothesis Hq : \fj,\ > e [see Wellek (2010)]. 
The same phenomenon also appears in the present context of comparing curves. These 
properties may limit the use of the procedures proposed by Liu et al. (2009) and Gsteiger 
et al. (2011) in practice, as we would like to maximize the probability of rejecting the null 
hypothesis if the two regression curves are in fact equivalent as measured by the relevance 
threshold e. 

In the following, we develop alternatives approaches that are more powerful. Roughly 
speaking, we consider for i = 1,2 the estimator of the regression curve and 

reject the null hypothesis (2.2) for small values of the statistic d = d{Pi,/32)- The critical 
values can be obtained by asymptotic theory deriving the limit distribution of ^/n {d — d) 
if ni,n 2 —)■ oo, as developed in the following sections. This approach leads to a satisfac¬ 
tory solution for the L^-distance (2.4) based on the quantiles of the normal distribution 
(see Section 3). However, for the maximal deviation distance (2.3), the limit distribution 
depends in a complicated way on the extremal points 

S = {x e X \ \A{x,/3i,/32 )\ = d^{/3i,/32)} 

of the true difference 


/5i,/32) ^ mi(x,/3i) - m 2 (x, ^ 2 )- 


(2.5) 


Moreover, in small sample trials the approximation of the nominal level of a given test based 
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on asymptotic theory may not be valid. In order to obtain a more accurate approximation of 
the nominal level, we propose a non-standard bootstrap procedure and prove its consistency. 
This procedure has to be constructed in a way such that it addresses the particular features 
of the equivalence hypotheses (2.2). In particular, data have to be generated under the null 
hypothesis d{/ 3 i,/ 32 ) > s, which implicitly dehnes a manifold for the vector of parameters 
G MP 1 +P 2 Qf both models. The non-differentiability of the maximal deviation 
distance d^o exhibits some technical difficulties of such an approach, and for this reason we 
begin the discussion with the L^-distance ^ 2 - 


3 Comparing curves by L^-distances 


In this section we construct a test for the equivalence of the two regression curves with 
respect to the squared distance, i.e. we consider hypotheses of the form 

Hq : f {mi{x, fdi) — m 2 {x,/ 32 )Ydx > 62 versus Hi: f (mi{x,/3i) — m 2 {x,/ 32 )Ydx < 62 - 


' X 


X 

(3.1) 


Note that under certain regularity assumptions (see the Appendix for details) the ordinary 
least squares (OLS) estimators, say (3i and /52, of the parameters (3i and (32 can usually be 
linearized in the form 


0e - (3i) 


1 


EE + Op(l)5 

i=l j=l 


£= 1 , 2 , 


(3.2) 


where the functions are given by 







£ = 1,2, 


(3.3) 


and the {p£ x p^)—dimensional matrices are dehned by 

’ ^=1,2. (3.4) 

^ i=i 

For these arguments we assume that the matrices are non-singular and that the sample 
sizes n£ converge to inhnity such that 


lim — = Cq* > 0 , i = 1,..., , £ = 1, 2, (3.5) 

rii^oo m 


7 



and 

Tl 

lim — = Ag(1,oo). (3.6) 

ni,n2—^oo m 

It then follows by straightforward calculation that the OLS estimators are asymptotically 
normal distributed, i.e. 

v^(/ 3£ - 4 Ar(0, S 7 I) ,£=1,2, (3.7) 

where the symbol —?■ means weak convergence (convergence in distribution for real valued 
random variables). The asymptotic variance in (3.7) can easily be estimated by replacing 
the parameters oi and in (3.4) by their estimators Pi, di and (£ = 1, 2). The 

resulting estimator will be denoted by throughout this paper. The null hypothesis in 
(3.1) is then rejected whenever 

d 2 ■■= d 2 {Pi, P 2 ) = {mi{x, Pi) - m 2 {x,/32))^dx < c, (3.8) 

Jx 

where c denotes a pre-specihed constant dehned through the level of the test. In order to 
determine this constant we will derive the asymptotic distribution of the statistic d 2 - The 
following result is proved in the Appendix. 

Theorem 3.1. If Assumptions 7.1 - 7.5 from the Appendix, (3.5) and (3.6) are satisfied, 
we have 

Vn{d2 - 4) A/'(0, (3.9) 

where the asymptotic variance is given by 

= ^d 2 (/^n/^ 2 ) = 4 / A{x, Pi, P 2 )A{y, Pi, P 2 )k{x,y)dxdy, (3.10) 

JxxX 

A{x, Pi, P 2 ) is defined in (2.5) and the kernel k{x,y) is given by 

k{x,y) := X{^^mi{x,bi)l^^J^Ef^^^mi{y,bi)l^^^^ 

+ 62 ) 1 , 62 ) 1 ,^=^^. (3.11) 

Theorem 3.1 provides a simple asymptotic level-a test for the hypothesis (3.1) of equiva¬ 
lence of two regression curves. More precisely, if = o'‘^^{Pi, P 2 ) denotes the (canonical) 

8 



estimator of the asymptotic variance in (3.10), then the null hypothesis in (3.1) is rejected 
if 


d 2 < €2 -\ - -^Ua, (3.12) 

yn 

where Ua denotes the a-quantile of the standard normal distribution. Note that by the 
nature of the problem the quantile of this test depends on the threshold 62 - The hnite 
sample properties of this test will be investigated in Section 5.1. 


Remark 3.2. It follows from Theorem 3.1 that the test (3.12) has asymptotic level a 
and is consistent if ni,n 2 —)■ cxd. More precisely, if denotes the cumulative distribution 
function of the standard normal distribution, we have for the probability of rejecting the 
null hypothesis in (3.1) 

lPfd2 < £2 + = P(^ ~Z~{d2 ~ *^ 2 ) < -Z~{£2 ~ <^ 2 ) + Ua\ ■ 

V y/n J \ ad 2 ( 7 d 2 ^ 


Under continuity assumptions it follows that —)■ Theorem 3.1 yields y/n (^2 


d 2 )/dd 2 A/"(0,1). This gives 


P( d2 ^ S2 -\ -) ni,n2^oo ^ 

'Tl 


0 if (^2 > ^2 

a \i d 2 = €2 

1 if d 2 < 62 


The test (3.12) can be recommended if the sample sizes are reasonable large. However, we 
will demonstrate in Section 5 that for very small sample sizes, the critical values provided 
by this asymptotic theory may not provide an accurate approximation of the nominal level, 
and for this reason we will also investigate a parametric bootstrap procedure to generate 
critical values for the statistic d 2 - 


Algorithm 3.3. (parametric bootstrap for testing precise hypotheses) 

(1) Calculate the OLS-estimators /3i and 132 , the corresponding variance estimators 


^ k£ 

o-f = — V y](^qij - (3t)f-, ^=1,2, 

1=1 j=i 

and the test statistic d 2 = d 2 {j 3 i,( 32 ) defined by (3.8). 
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(2) Define estimators of the parameters /3i and (32 by 


13 i = 


j3i if d 2 > €2 
if d 2 < 62 


= 1 , 2 , 


(3.13) 


where /3i, (32 denote the OLS-estimators of the parameters I3i, (32 under the constraint 


d2{l3i,l32)= / {mi{x,(3i)-m2{x,(32)Ydx = 62 - 
Jx 


(3.14) 


Finally, dehne ^2 = <^ 2 (/ 5 i,/ 92 ) and note that d 2 > 62 - 
(3) Bootstrap test 


(i) Generate bootstrap data under the null hypothesis, that is 


rij = rat{xt,i, Pi) + rilij , * = 1,..., n»,i, <=1,2, 


(3.15) 


where the errors rj^ - j are independent normally distributed such that ? 7 | ~ 

A/'( 0 ,d^ 2 )_ 

(ii) Calculate the OLS estimators (3^ and and the test statistic 

d*2 = d20*i,Pl) = [ {mi{x,^l) - m2{x,P2)Ydx 
Jx 

from the bootstrap data. Denote by ^^,2 the a—quantile of the distribution of the 
statistic ^ 2 , which depends on the data {Yi^ij\l = 1 , 2; j = 1 , ...rii/, * = 1 , •••, h} 
through the estimators /5i and ( 32 - 

The steps (i) and (ii) are repeated B times to generate replicates i, • • •, <^2 b of 
If < • • • < d* 2 ^'^ denotes the corresponding order statistic, the estimator of the 
quantile of the distribution of d^ is dehned by := and the null hypothesis 

is rejected if 

d2 < qS- ( 3 - 16 ) 

/ D'\ 

Note that the bootstrap quantile q(^ 2 depends on the threshold 62 which is used in 
the hypothesis (3.1), but we do not reflect this dependence in our notation. 
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The following result shows that the bootstrap test (3.16) has asymptotic level a and is 
consistent if ni,n 2 —)■ oo. Its proof can be found in the Appendix. 


Theorem 3.4. Assume that the conditions of Theorem 3.1 are satisfied. 
(1) If the null hypothesis in (3.1) holds, then we have for any a G (0,0.5) 


lim F{d 2 < qa, 2 ) = 

ni ,n2^oo 


0 if d 2 > 82 

CH if 6^2 “ 82 


(3.17) 


(2) If the alternative in (3.1) holds, then we have for any a G (0, 0.5) 


lim ¥{3,2 < qa, 2 ) = 1 - 

ni , 712^00 


(3.18) 


4 Comparing curves by their maximal deviation 

In this section we construct a test for the equivalence of the two regression curves with 
respect to the maximal absolute deviation (2.3). The corresponding test statistic is given 
by the maximal deviation distance 

dcx^ = doo0i,fi2) = max - m 2 {x,fi 2 )\ (4.1) 

x&X 

between the two estimated regression functions, where fii, II 2 are the OLS-estimators from 
the two samples. In order to describe the asymptotic distribution of the statistic doo we 
dehne the set of extremal points 

S = {x e X\ \mi{x,/3i) - m 2 {x,/ 32 )\ = d^} (4.2) 

and introduce the decomposition S = U S~, where 

= [x e X\ mi{x,/3i) - m2{x,/32) = T doo}- (4.3) 

The following result is proved in the Appendix. 

Theorem 4.1. If doo >0 and the assumptions of Theorem 3.1 are satisfied, then 

y/n {doo ~ doo) Z ;= max | maxG(a;), max(—G(a;))}, (4.4) 

^ xe£’+ x&€~ 
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where {G{x)}x^x denotes a Gaussian process defined by 


GW = (4.5) 

and Zi and Z 2 are independent pi- and p 2 -dimensional standard normal distributed random 
variables, respectively, i.e. Zi ~ A/'(0,/pJ, £ = 1,2. 

In principle, Theorem 4.1 provides an asymptotic level a-test for the hypotheses 

-f^o : doo{/3i,/32) > £00 versus Hi : dofi/3i,/32) < £00 (4.6) 

by rejecting the null hypotheses whenever d^o < (la,ooi where denotes the a-quantile of 
the distribution of the random variable Z dehned in (4.4). However, this distribution has 
a very complicated structure. For example, if T = {xq} the distribution of Z is a centered 
normal distribution but with variance 

which depends on the location of the (unique) extremal point xq. In general (more precisely 
in the case > 1) the distribution of Z is the distribution of a maximum of dependent 
Gaussian random variables, where the variances and the dependence structure depend on 
the location of the extremal points of the function A{-, fii, ^ 2 )- Because the estimation of 
these points is very difficult, we propose a bootstrap approach to obtain suitable quantiles. 
The bootstrap test is dehned in the same way as described in Algorithm 3.3, where the 
distance ^2 is replaced by the maximal deviation d^o- The corresponding quantile obtained 
in Step 3(ii) of Algorithm 3.3 is now denoted by ga,oo, while the theoretical quantile of the 
bootstrap distribution is denoted by qa,oo- The following result is proved in the Appendix 
and shows that the test, which rejects the null hypothesis in (4.6) whenever 

doo < qi%: (4.8) 

has asymptotic level a and is consistent. Interestingly the quality of the approximation of 
the nominal level of the test depends on the cardinality of the set S. 

Theorem 4.2. Suppose that the assumptions of Theorem 4-1 hold. 
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(1) If the null hypothesis in (4.6) is satisfied and the set S defined in (4.2) consists of one 
point, then we have for any a G (0, 0.5) 


lim F(doo < qa,oo) 

ni,712^00 


0 ^oo ^ ^oo 

^ ^oo '^oo* 


(4.9) 


(2) Let Fz denote the distribution function of the random variable Z defined in (4.4) and 
qz,a its a-quantile. Assume that Fz is continuous at qz,a and qz,a <0. If the null 
hypothesis in (4.6) is satisfied we have 

limsupP((ioo < ga,oo) < tt. (4.10) 

ni,n2^cxD 

(3) If the alternative in (4.6) is satisfied, then we have for any a G (0, 0.5) 


lim P((ioo < qa,oo) = 1- (4-11) 

ni,n2^c>o 


Remark 4.3. 

(a) The condition in part (2) of Theorem 4.2 is a non-trivial assumption. By results in 
Tsirel’son (1976), the distribution of Z has at most one jump at the left boundary of its 
support and is continuous to the right of that. The condition on Fz to be continuous at 
qz^a is thus equivalent to requiring that the mass at the left endpoint of the support of 
Fz is smaller than a. In some cases it is possible to show that Fz is continuous on M, 
i.e. the mass at its left support point is zero. For example, this follows from Theorem 3 of 
Chernozhukov et ah (2015) provided that the condition 

> 0 (4.12) 

holds. 

The assumption (4.12) is always fulhlled if one of the models contains an additive (placebo) 
effect because in this case the hrst entry of the gradient equals 1. Fur¬ 
thermore, if the two models are of the form ■ m^{x,l3g) for / = 1,2 with 

fie = {fie,1, fie) (for example Michalis-Menten models), we have 
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Consequently, if (4.12) was not fulfilled, there would exist Xq E S such that m^{xo, (3^) = 
0, i = I, 2, and as xq E S it holds 

doo = \mi{xo,/3i) - m2{xo,/32)\ = |/3i,i ■ m%xo,/3^) - (32,i ■ nil{xo,(32)\ = 0 

This yields mi = m 2 and does not correspond to the null hypothesis. 

(c) Note that the asymptotic Type I error rate of the bootstrap test is precisely a at 
the boundary of the hypothesis (i.e. doo = ^oo) if the cardinality of E is one. On the 
other hand, if the set E contains more than one point, part (2) of Theorem 4.2 indicates 
that the corresponding bootstrap test is usually conservative, even at the boundary of the 
hypothesis. These results are confirmed by a simulation study in Section 5.2. 


5 Finite sample properties 


In this section we investigate the hnite sample properties of the asymptotic and bootstrap 
tests proposed in Sections 3 and 4 in terms of power and size. For the distance doo we also 
provide a comparison with the approach from Gsteiger et ah (2011). Their method follows 
from (3.5) - (3.7) and an application of the Delta method [see for example Van der Vaart 
(1998)] so that the prediction mi{x, /3i) — m 2 {x, (32) for the difference of the two regression 
models at the point x is approximately normally distributed. That is, 
mi(a;,/ 3 i) - m 2 {x, ^2) - imi{x,( 3 i) - 7712(0;,/^a)) 


{x, (31,(32) 


A7(0,l), 


where 




7=1 


(5.1) 


and S£ denotes the estimator of the variance in (3.4), which is obtained by replacing the 
parameters (3^ and Q,i by their estimators (3^, and nt^i/rii {i = 1,2). Gsteiger 
et al. (2011) proposed a test based on the pointwise conhdence bands derived by Liu et ah 
(2007a), that is 

mi{x,^i) - 7772(0;,/^a) ± zi —oTni ,n2 (0;,/ 5 i,/ 32 ), 

where zi-a denotes the (1 — Q;)-quantile of the standard normal distribution. A test for 
the hypotheses (4.6) is finally obtained by rejecting the null hypothesis and conclude for 


14 



equivalence, if the maximum (minimum) of the upper (lower) conhdence band is smaller 
(larger) than £00 (—^oo)- A particular advantage of this test is that it directly refers to the 
distance (2.3), which has a nice interpretation in many applications. Moreover, in linear 
models (with normally distributed errors) it is an exact level-a test. However, the resulting 
test is conservative and has low power compared to the methods proposed in this paper as 
shown in Section 5.2. 

All results in this and the following section are based on 1000 simulation runs and the 
quantiles of the bootstrap tests have been obtained hy B = 300 bootstrap replications. In 
all examples the dose range is given by the interval X = [0,4] and an equal number of 
patients is allocated at the hve dose levels = 0,a;£,2 = 1, = 2, = 3, = 4 in 

both groups (that is ki = k 2 = X). 

5.1 Tests based on the distance d 2 

For the sake of brevity we restrict ourselves to a comparison of two shifted EMAX-models 


mi{x,l3i) = I3n + and m 2 {x, [32) = P 21 + , (5.2) 

where /3i = {dn, du, dis) = (<^, 5,1) and ^2 = (/d 2 i,/d 22 ,/923) = (0,5,1). In Tables 1 and 2 
we display the simulated Type I error rates of the bootstrap test (3.16) and the asymptotic 
test (3.12) for £2 = 1 in (3.1) and various conhgurations of ni, n 2 and 5. In the 

interior of the null hypothesis (i.e. d 2 > £ 2 ) the Type I error rates of the tests (3.12) and 
(3.16) are smaller than the nominal level as predicted by Remark 3.2. For both tests we 
observe a rather precise approximation of the nominal level (even for small sample sizes) 
at the boundary of the null hypothesis (i.e. d 2 = 1). In some cases the approximation of 
the nominal level by the bootstrap test (3.16) is slightly more accurate and for this reason 
we recommend to use the bootstrap test (3.16) to establish equivalence of two regression 
models with respect to the L^-distance. 

In Tables 3 and 4 we display the power of the two tests under various alternatives specihed 
by the value = <5 in model (5.2). We observe a reasonable power of both tests in all cases 
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a - 0.05 

q : = 0.1 




(A, A) 

(A, A) 

(" 1 , " 2 ) 

S 

d 2 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(10,10) 

1 

4 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(10,10) 

0.75 

2.25 

0.004 

0.002 

0.001 

0.000 

0.002 

0.000 

(10,10) 

0.5 

1 

0.051 

0.064 

0.052 

0.101 

0.120 

0.118 

(10,20) 

1 

4 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(10,20) 

0.75 

2.25 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(10,20) 

0.5 

1 

0.055 

0.060 

0.051 

0.104 

0.111 

0.101 

(20,20) 

1 

4 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(20,20) 

0.75 

2.25 

0.001 

0.002 

0.000 

0.004 

0.005 

0.001 

(20,20) 

0.5 

1 

0.057 

0.058 

0.050 

0.125 

0.107 

0.097 

(50,50) 

1 

4 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(50,50) 

0.75 

2.25 

0.001 

0.000 

0.000 

0.002 

0.000 

0.000 

(50,50) 

0.5 

1 

0.057 

0.048 

0.054 

0.097 

0.114 

0.093 


Table 1: Simulated Type I error rates of the bootstrap test (3.16) for the equivalenee of two 
shifted EMAX models defined in (5.2). The threshold in (3.1) is ehosen as €2 = 1. 





a - 0.05 

q : = 0.1 




(A, A) 

(A, A) 

(" 1 , " 2 ) 

S 

d 2 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(10,10) 

1 

4 

0.002 

0.002 

0.002 

0.000 

0.002 

0.003 

(10,10) 

0.75 

2.25 

0.005 

0.005 

0.009 

0.007 

0.011 

0.016 

(10,10) 

0.5 

1 

0.080 

0.042 

0.049 

0.102 

0.061 

0.071 

(10,20) 

1 

4 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(10,20) 

0.75 

2.25 

0.007 

0.012 

0.007 

0.017 

0.015 

0.012 

(10,20) 

0.5 

1 

0.055 

0.063 

0.060 

0.081 

0.078 

0.084 

(20,20) 

1 

4 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(20,20) 

0.75 

2.25 

0.000 

0.001 

0.002 

0.017 

0.003 

0.006 

(20,20) 

0.5 

1 

0.060 

0.066 

0.080 

0.090 

0.091 

0.096 

(50,50) 

1 

4 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

(50,50) 

0.75 

2.25 

0.000 

0.000 

0.000 

0.000 

0.000 

0.001 

(50,50) 

0.5 

1 

0.041 

0.058 

0.052 

0.071 

0.087 

0.073 


Table 2: Simulated Type I error rates of the asymptotie test (3.12) for the equivalenee of 
two shifted EMAX models defined in (5.2). The threshold in (3.1) is ehosen as €2 = 1 


under consideration. In those cases where the asymptotic test (3.12) keeps (or exceeds) 
its nominal level it is slightly more powerful than the bootstrap test (3.16). The opposite 
performance can be observed in those cases where the asymptotic test is conservative (e.g., 
if q ; = 10%, ni = n 2 = 10). We also note that the power of both tests is a decreasing 
function of the distance ^ 2 , as predicted by the asymptotic theory. 
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a - 0.05 

q : = 0.1 




(A, A) 

(alA) 

(" 1 , " 2 ) 

5 

d2 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(10,10) 

0.25 

0.25 

0.210 

0.118 

0.134 

0.300 

0.212 

0.256 

(10,10) 

0.1 

0.04 

0.294 

0.132 

0.186 

0.427 

0.250 

0.312 

(10,10) 

0 

0 

0.351 

0.145 

0.176 

0.467 

0.286 

0.340 

(10,20) 

0.25 

0.25 

0.257 

0.125 

0.191 

0.392 

0.234 

0.305 

(10,20) 

0.1 

0.04 

0.395 

0.164 

0.254 

0.535 

0.305 

0.395 

(10,20) 

0 

0 

0.437 

0.158 

0.291 

0.598 

0.290 

0.474 

(20,20) 

0.25 

0.25 

0.392 

0.171 

0.225 

0.534 

0.302 

0.382 

(20,20) 

0.1 

0.04 

0.560 

0.308 

0.418 

0.720 

0.460 

0.562 

(20,20) 

0 

0 

0.610 

0.314 

0.390 

0.757 

0.462 

0.555 

(50,50) 

0.25 

0.25 

0.724 

0.460 

0.554 

0.825 

0.595 

0.825 

(50,50) 

0.1 

0.04 

0.961 

0.691 

0.821 

0.982 

0.824 

0.973 

(50,50) 

0 

0 

0.984 

0.734 

0.865 

0.998 

0.861 

0.999 


Table 3: Simulated power of the bootstrap test (3.16) for the equivalenee of two shifted 
EMAX models defined in (5.2). The threshold in (3.1) is ehosen as €2 = 1. 





a - 0.05 

q : = 0.1 




(A, 4) 

(A, A) 

(" 1 , " 2 ) 

S 

d 2 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(0.25,0.25) 

(0.5, 0.5) 

(0.25,0.5) 

(10,10) 

0.25 

0.25 

0.264 

0.103 

0.175 

0.311 

0.131 

0.217 

(10,10) 

0.1 

0.04 

0.351 

0.139 

0.196 

0.431 

0.183 

0.247 

(10,10) 

0 

0 

0.381 

0.120 

0.222 

0.468 

0.168 

0.279 

(10,20) 

0.25 

0.25 

0.305 

0.147 

0.256 

0.382 

0.192 

0.317 

(10,20) 

0.1 

0.04 

0.468 

0.218 

0.359 

0.536 

0.268 

0.438 

(10,20) 

0 

0 

0.510 

0.220 

0.358 

0.570 

0.272 

0.455 

(20,20) 

0.25 

0.25 

0.423 

0.271 

0.321 

0.493 

0.328 

0.341 

(20,20) 

0.1 

0.04 

0.640 

0.328 

0.501 

0.716 

0.407 

0.585 

(20,20) 

0 

0 

0.690 

0.351 

0.501 

0.781 

0.438 

0.573 

(50,50) 

0.25 

0.25 

0.659 

0.475 

0.534 

0.740 

0.562 

0.649 

(50,50) 

0.1 

0.04 

0.965 

0.750 

0.868 

0.974 

0.813 

0.911 

(50,50) 

0 

0 

0.980 

0.848 

0.937 

0.991 

0.893 

0.946 


Table 4: Simulated power of the asymptotie test (3.12) for the equivalenee of two shifted 
EMAX models defined in (5.2). The threshold in (3.1) is ehosen as €2 = 1- 


5.2 Tests based on the distance doo 

We now investigate the maximum deviation distance and also provide a comparison with 
the test proposed by Gsteiger et al. (2011). Motivated by the discussion in Section 4 we 
distinguish the cases where the cardinality of the set £ is one or larger than one. The results 
will show that with an increasing size of the set £ the test is getting more conservative. 
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Example 5.1. {jj^£ = 1) We begin with a comparison of an EMAX with an exponential 
model, that is 


mi(a;,/3i) = /3ii + 




and m2(a;,/32) =/32 i +/9 


As + X 

where (3i = (Ai,/3i2, As) = (1,2,1) and A = (Ai,A2,A3) = 


22-(exp (^)- 

(h, 2.2, 8) [see 


- 1). (5.3) 

Figure 1]. In 



T-1- \ -1-T 

0 12 3 4 

Dose 


Figure 1: The models defined in (5.3) for two different choices of 6. 


Table 5 we display the simulated rejection probabilities of the bootstrap test (4.8) under 
the null hypothesis in (4.6), where Soo = 1- 

If the cardinality of the set £ is one, the distribution of the test statistic doo is a centered 
normal distribution with variance dehned in (4.7). Thus, if the unique extremal point has 
been estimated, we obtain an estimate, say of the asymptotic variance of the statistic 
doo- The null hypothesis is now rejected (at asymptotic level a), whenever 




doo ^ '^oo 4“ /— 


n 


(5.4) 


where Ua is the a—quantile of the standard normal distribution. The results for this test 
are given in Table 6. We observe that the bootstrap test (4.8) keeps its nominal level 
at the boundary of the null hypothesis, whereas the level is smaller in the interior (this 
conhrms the theoretical results from Section 4). The approximation is less precise for small 
sample sizes. Compared to the bootstrap test based on the distance d 2 the test (4.8) is 
conservative. The asymptotic test (5.4) is very conservative, even for relative large sample 
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sizes (see Table 6). A possible explanation for this observation consists in the fact that 
the estimation of the extremal point is a difficult problem. In Table 5 we also display the 
rejection probabilities of the test of Gsteiger et ah (2011) in brackets. This test is very 
conservative as its level is practically 0 for almost all cases under consideration. 

The simulated power of the bootstrap and the asymptotic doo-test is displayed in Table 
7 and 8. We observe a substantially better performance of the bootstrap test (4.8) in all 
cases of consideration. In Table 7 we also display the rejection probabilities of the test of 
Gsteiger et ah (2011) in brackets and we conclude that the methods proposed in this paper 
yield a substantial improvement for small sample sizes or large variances. 





a - 0.05 

OL = O.l 




(<^7 A) 

(A, A) 


(5 

doo 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

0.25 

1.5 

0.001 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 10 , 10 ) 

0.5 

1.25 

0.005 ( 0 . 000 ) 

0.011 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.013 ( 0 . 000 ) 

0.030 ( 0 . 000 ) 

0.020 ( 0 . 000 ) 

( 10 , 10 ) 

0.75 

1 

0.045 ( 0 . 007 ) 

0.037 ( 0 . 000 ) 

0.036 ( 0 . 001 ) 

0.102 ( 0 . 021 ) 

0.086 ( 0 . 002 ) 

0.090 ( 0 . 007 ) 

( 10 , 20 ) 

0.25 

1.5 

0.000 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 10 , 20 ) 

0.5 

1.25 

0.004 ( 0 . 000 ) 

0.013 ( 0 . 000 ) 

0.005 ( 0 . 000 ) 

0.015 ( 0 . 000 ) 

0.025 ( 0 . 000 ) 

0.009 ( 0 . 000 ) 

( 10 , 20 ) 

0.75 

1 

0.045 ( 0 . 017 ) 

0.046 ( 0 . 002 ) 

0.028 ( 0 . 004 ) 

0.099 ( 0 . 042 ) 

0.104 ( 0 . 011 ) 

0.079 ( 0 . 017 ) 

( 20 , 20 ) 

0.25 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 20 , 20 ) 

0.5 

1.25 

0.001 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.018 ( 0 . 000 ) 

0.011 ( 0 . 000 ) 

( 20 , 20 ) 

0.75 

1 

0.034 ( 0 . 013 ) 

0.038 ( 0 . 003 ) 

0.048 ( 0 . 007 ) 

0.091 ( 0 . 033 ) 

0.100 ( 0 . 020 ) 

0.104 ( 0 . 026 ) 

( 50 , 50 ) 

0.25 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 50 , 50 ) 

0.5 

1.25 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 50 , 50 ) 

0.75 

1 

0.051 ( 0 . 013 ) 

0.059 ( 0 . 009 ) 

0.058 ( 0 . 010 ) 

0.096 ( 0 . 040 ) 

0.108 ( 0 . 043 ) 

0.105 ( 0 . 096 ) 


Table 5: Simulated Type I error rates of the bootstrap test (4.8) for the equivalenee of 
an EMAX and an exponential model defined by (5.3). The threshold in (4.6) is ehosen as 
£oo = 1 ■ The numbers in braekets show the simulated Type I error rates of the test proposed 
in Gsteiger et al. (2011). 


In the remaining part of this section we consider three further scenarios, where the true 
maximum absolute distance of the models mi and m 2 is attained at more than one point. 
In this case, an asymptotic test based on the maximum deviation is not available and 
therefore only the bootstrap test can be used. Our results demonstrate that the test is 
conservative compared to the scenario before, where ff£ = 1. This conhrms our theoretical 
hndings in Theorem 4.2. Moreover, the test gets more conservative if the size of the set £ 
increases. We also display (the numbers in brackets) the corresponding values for the test 
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a - 0.05 

q : = 0.1 




(A, A) 

Pi A) 

(" 1 , " 2 ) 

S 

doo 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

0.25 

1.5 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

( 10 , 10 ) 

0.5 

1.25 

0.001 

0.001 

0.000 

0.003 

0.004 

0.001 

( 10 , 10 ) 

0.75 

1 

0.012 

0.005 

0.003 

0.029 

0.010 

0.001 

( 10 , 20 ) 

0.25 

1.5 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

( 10 , 20 ) 

0.5 

1.25 

0.000 

0.005 

0.001 

0.000 

0.007 

0.003 

( 10 , 20 ) 

0.75 

1 

0.019 

0.006 

0.009 

0.038 

0.014 

0.023 

( 20 , 20 ) 

0.25 

1.5 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

( 20 , 20 ) 

0.5 

1.25 

0.000 

0.000 

0.001 

0.000 

0.001 

0.001 

( 20 , 20 ) 

0.75 

1 

0.011 

0.036 

0.009 

0.033 

0.025 

0.027 

( 50 , 50 ) 

0.25 

1.5 

0.000 

0.000 

0.000 

0.000 

0.000 

0.000 

( 50 , 50 ) 

0.5 

1.25 

0.000 

0.000 

0.000 

0.000 

0.003 

0.000 

( 50 , 50 ) 

0.75 

1 

0.016 

0.015 

0.012 

0.039 

0.039 

0.041 


Table 6: Simulated Type I error rates of the asymptotie test (5.4) for the equivalenee of 
an EMAX and an exponential model defined by (5.3). The threshold in (4.6) is ehosen as 

^OO 1 





a - 0.05 

a = 0.1 




( A . A ) 

( A , A ) 

(" 1 , " 2 ) 

(5 

doo 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

1 

0.75 

0.160 ( 0 . 026 ) 

0.093 ( 0 . 004 ) 

0.125 ( 0 . 007 ) 

0.297 ( 0 . 083 ) 

0.225 ( 0 . 007 ) 

0.229 ( 0 . 033 ) 

( 10 , 10 ) 

1.5 

0.5 

0.237 ( 0 . 037 ) 

0.133 ( 0 . 003 ) 

0.164 ( 0 . 009 ) 

0.383 ( 0 . 117 ) 

0.231 ( 0 . 018 ) 

0.309 ( 0 . 029 ) 

( 10 , 20 ) 

1 

0.75 

0.185 ( 0 . 084 ) 

0.123 ( 0 . 006 ) 

0.159 ( 0 . 025 ) 

0.320 ( 0 . 162 ) 

0.226 ( 0 . 035 ) 

0.283 ( 0 . 089 ) 

( 10 , 20 ) 

1.5 

0.5 

0.300 ( 0 . 087 ) 

0.175 ( 0 . 005 ) 

0.269 ( 0 . 035 ) 

0.457 ( 0 . 190 ) 

0.305 ( 0 . 043 ) 

0.414 ( 0 . 120 ) 

( 20 , 20 ) 

1 

0.75 

0.214 ( 0 . 130 ) 

0.138 ( 0 . 022 ) 

0.171 ( 0 . 054 ) 

0.393 ( 0 . 248 ) 

0.271 ( 0 . 086 ) 

0.345 ( 0 . 137 ) 

( 20 , 20 ) 

1.5 

0.5 

0.401 ( 0 . 190 ) 

0.229 ( 0 . 036 ) 

0.363 ( 0 . 080 ) 

0.604 ( 0 . 356 ) 

0.398 ( 0 . 122 ) 

0.523 ( 0 . 189 ) 

( 50 , 50 ) 

1 

0.75 

0.504 ( 0 . 400 ) 

0.274 ( 0 . 183 ) 

0.363 ( 0 . 297 ) 

0.662 ( 0 . 552 ) 

0.416 ( 0 . 326 ) 

0.532 ( 0 . 433 ) 

( 50 , 50 ) 

1.5 

0.5 

0.777 ( 0 . 667 ) 

0.491 ( 0 . 294 ) 

0.606 ( 0 . 493 ) 

0.877 ( 0 . 791 ) 

0.648 ( 0 . 478 ) 

0.739 ( 0 . 604 ) 


Table 7: Simulated power of the bootstrap test (4.8) for the equivalenee of an EMAX and 
an exponential model defined by (5.3). The threshold in (4.6) is ehosen as Eoo = 1- The 
numbers in braekets show the simulated power of the test proposed in Gsteiger et al. (2011). 


of Gsteiger et al. (2011), which is very conservative and less powerfnl than the bootstrap 
test in all cases nnder consideration. 

Example 5.2. = 2) We consider two EMAX models, given by 

mi{x, (5i) = (5ii + and 7712 ( 0 ;, /32) = (32i + , (5.5) 

P13 + X P23 + X 

where /3i = (/^n,/3i2,/^is) = (<5,6,2) and (32 = (/52i,/522,/523) = (0,5,1). In this case 
the maximnm absolnte difference of d^o = <5 is attained at the bonndary points of the 
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a - 0.05 

a = 0.1 






(" 1 , " 2 ) 

6 

doo 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

1 

0.75 

0.042 

0.006 

0.011 

0.109 

0.017 

0.046 

( 10 , 10 ) 

1.5 

0.5 

0.064 

0.008 

0.014 

0.140 

0.026 

0.047 

( 10 , 20 ) 

1 

0.75 

0.114 

0.014 

0.048 

0.199 

0.047 

0.106 

( 10 , 20 ) 

1.5 

0.5 

0.129 

0.018 

0.059 

0.228 

0.052 

0.127 

( 20 , 20 ) 

1 

0.75 

0.151 

0.036 

0.064 

0.285 

0.093 

0.170 

( 20 , 20 ) 

1.5 

0.5 

0.209 

0.060 

0.104 

0.360 

0.120 

0.202 

( 50 , 50 ) 

1 

0.75 

0.417 

0.206 

0.303 

0.569 

0.337 

0.440 

( 50 , 50 ) 

1.5 

0.5 

0.706 

0.267 

0.408 

0.826 

0.462 

0.630 


Table 8: Simulated power of the asymptotie test (5.4) for the equivalenee of an EM AX and 
an exponential model defined by (5.3). The threshold in (4.6) is chosen as Soo = 

design space, that is T = {0,4}. The corresponding rejection probabilities under the null 
hypothesis are presented in Table 9. We observe that the bootstrap test keeps its level 
in all situations, but it is conservative (see also Theorem 4.2). The test of Gsteiger et ah 
(2011) is even more conservative. We also observe an improvement in power by the new 
test in comparison with the test of Gsteiger et ah (2011), in particular for small samples 
sizes (see Table 10). 




a - 0.05 

a = 0.1 



(A, A) 

(A, A) 

(" 1 , " 2 ) 

doo = 5 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

2 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

( 10 , 10 ) 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

( 10 , 10 ) 

1 

0.018 ( 0 . 002 ) 

0.017 ( 0 . 001 ) 

0.008 ( 0 . 000 ) 

0.042 ( 0 . 006 ) 

0.042 ( 0 . 003 ) 

0.034 ( 0 . 000 ) 

( 10 , 20 ) 

2 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 10 , 20 ) 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.003 ( 0 . 000 ) 

( 10 , 20 ) 

1 

0.012 ( 0 . 002 ) 

0.015 ( 0 . 000 ) 

0.009 ( 0 . 000 ) 

0.052 ( 0 . 005 ) 

0.049 ( 0 . 003 ) 

0.042 ( 0 . 000 ) 

( 20 , 20 ) 

2 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 20 , 20 ) 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 20 , 20 ) 

1 

0.023 ( 0 . 000 ) 

0.012 ( 0 . 000 ) 

0.015 ( 0 . 003 ) 

0.066 ( 0 . 008 ) 

0.043 ( 0 . 001 ) 

0.042 ( 0 . 004 ) 

( 50 , 50 ) 

2 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 50 , 50 ) 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

( 50 , 50 ) 

1 

0.022 ( 0 . 002 ) 

0.020 ( 0 . 000 ) 

0.016 ( 0 . 002 ) 

0.050 ( 0 . 007 ) 

0.048 ( 0 . 006 ) 

0.049 ( 0 . 007 ) 


Table 9: Simulated Type I error rates 
of two EMAX-models defined by (5.5). 
numbers in brackets show the simulated 
et al. (2011). 


of the bootstrap doo-test (4.8) for the equivalence 
The threshold in (4.6) is chosen as Eoo = 1- The 
Type I error rates of the test proposed by Gsteiger 
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a - 0.05 

a = 0.1 



(A-A) 

(A, A) 

(" 1 , " 2 ) 

doQ = (5 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

0.75 

0.074 ( O . Oll ) 

0.042 ( 0 . 003 ) 

0.059 ( 0 . 002 ) 

0.154 ( 0 . 040 ) 

0.116 ( 0 . 007 ) 

0.133 ( 0 . 021 ) 

( 10 , 10 ) 

0.5 

0.189 ( 0 . 055 ) 

0.111 ( 0 . 001 ) 

0.139 ( 0 . 011 ) 

0.312 ( 0 . 137 ) 

0.202 ( 0 . 014 ) 

0.249 ( 0 . 049 ) 

( 10 , 10 ) 

0 

0.267 ( 0 . 088 ) 

0.116 ( 0 . 003 ) 

0.170 ( 0 . 020 ) 

0.415 ( 0 . 211 ) 

0.237 ( 0 . 026 ) 

0.299 ( 0 . 070 ) 

( 10 , 20 ) 

0.75 

0.067 ( 0 . 015 ) 

0.053 ( 0 . 002 ) 

0.070 ( 0 . 007 ) 

0.161 ( 0 . 045 ) 

0.124 ( 0 . 019 ) 

0.156 ( 0 . 030 ) 

( 10 , 20 ) 

0.5 

0.229 ( 0 . 106 ) 

0.141 ( 0 . 006 ) 

0.185 ( 0 . 057 ) 

0.373 ( 0 . 231 ) 

0.249 ( 0 . 036 ) 

0.320 ( 0 . 138 ) 

( 10 , 20 ) 

0 

0.343 ( 0 . 172 ) 

0.191 ( 0 . 011 ) 

0.234 ( 0 . 062 ) 

0.513 ( 0 . 314 ) 

0.314 ( 0 . 055 ) 

0.376 ( 0 . 168 ) 

( 20 , 20 ) 

0.75 

0.120 ( 0 . 045 ) 

0.050 ( 0 . 005 ) 

0.074 ( 0 . 023 ) 

0.215 ( 0 . 102 ) 

0.149 ( 0 . 028 ) 

0.184 ( 0 . 068 ) 

( 20 , 20 ) 

0.5 

0.372 ( 0 . 239 ) 

0.192 ( 0 . 032 ) 

0.243 ( 0 . 079 ) 

0.531 ( 0 . 380 ) 

0.324 ( 0 . 117 ) 

0.367 ( 0 . 192 ) 

( 20 , 20 ) 

0 

0.462 ( 0 . 334 ) 

0.234 ( 0 . 049 ) 

0.338 ( 0 . 113 ) 

0.591 ( 0 . 511 ) 

0.392 ( 0 . 148 ) 

0.487 ( 0 . 260 ) 

( 50 , 50 ) 

0.75 

0.231 ( 0 . 133 ) 

0.110 ( 0 . 040 ) 

0.154 ( 0 . 069 ) 

0.368 ( 0 . 247 ) 

0.219 ( 0 . 108 ) 

0.296 ( 0 . 170 ) 

( 50 , 50 ) 

0.5 

0.708 ( 0 . 613 ) 

0.387 ( 0 . 294 ) 

0.542 ( 0 . 411 ) 

0.834 ( 0 . 770 ) 

0.554 ( 0 . 469 ) 

0.689 ( 0 . 593 ) 

( 50 , 50 ) 

0 

0.792 ( 0 . 773 ) 

0.528 ( 0 . 468 ) 

0.630 ( 0 . 576 ) 

0.873 ( 0 . 862 ) 

0.669 ( 0 . 627 ) 

0.757 ( 0 . 721 ) 


Table 10: Simulated power of the bootstrap doo-test (4.8) for the equivalence of two EMAX- 
models defined by (5.5). The threshold in (4.6) is chosen as Eoo = 1- The numbers in 
brackets show the simulated power of the test proposed in Gsteiger et al. (2011). 


Example 5.3. {ff£ = 3) In the next scenario we consider a quadratic and a linear model, 
that is 


mi{x,(di) = (diix^ + ( 5 i 2 X + /^is and m 2 {x,P 2 ) = I 32 ix + /322- (5.6) 

where /3i = (Ai,/3i2, As) = {d,—36,36) and A = {/ 621 , ( 622 ) = (<^A)- The maximum 
absolute distance is given by doo = 26, attained at £ = {0,2,4}. The simulated rejection 
probabilities under the null hypothesis are shown in Table 11. A comparison with Table 
10 shows that the level decreases with the size of the set of extremal points £. The results 
displayed in Table 12 show again that the new test has a larger power than the test of 
Gsteiger et al. (2011). 


Example 5.4. (£ = [0,4]) We conclude this section with an investigation of the models 
in (5.2) which represents somehow the extreme case, as the set of extremal points of the 
true absolute difference is given hy £ = [0,4], which is the entire dose range. In Table 13 
we display the rejection probabilities of the bootstrap test (4.8) under the null hypothesis. 
Corresponding results under the alternative are shown in Table 14, where it is demonstrated 
that the bootstrap test (4.8) yields again a substantial improvement in power compared 
to the test of Gsteiger et al. (2011). While this test has practically no power, the new 
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a - 0.05 

a = 0.1 





(" 1 , " 2 ) 

doo = 25 

{ 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

{ 10 , 10 ) 

2 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.010 ( 0 . 000 ) 

0.023 ( 0 . 000 ) 

( 10 , 10 ) 

1.5 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.005 ( 0 . 000 ) 

0.009 ( 0 . 000 ) 

0.009 ( 0 . 000 ) 

0.024 ( 0 . 000 ) 

{ 10 , 10 ) 

1 

0.007 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.009 ( 0 . 000 ) 

0.028 ( 0 . 002 ) 

0.025 ( 0 . 000 ) 

0.038 ( 0 . 001 ) 

{ 10 , 20 ) 

2 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

{ 10 , 20 ) 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

{ 10 , 20 ) 

1 

0.008 ( 0 . 000 ) 

0.009 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.028 ( 0 . 004 ) 

0.023 ( 0 . 000 ) 

0.022 ( 0 . 004 ) 

{ 20 , 20 ) 

2 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.024 ( 0 . 000 ) 

{ 20 , 20 ) 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.018 ( 0 . 000 ) 

{ 20 , 20 ) 

1 

0.005 ( 0 . 000 ) 

0.009 ( 0 . 000 ) 

0.003 ( 0 . 000 ) 

0.015 ( 0 . 001 ) 

0.030 ( 0 . 000 ) 

0.027 ( 0 . 000 ) 

{ 50 , 50 ) 

2 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.015 ( 0 . 000 ) 

{ 50 , 50 ) 

1.5 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.005 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.007 ( 0 . 000 ) 

{ 50 , 50 ) 

1 

0.001 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.014 ( 0 . 001 ) 

0.019 ( 0 . 000 ) 

0.027 ( 0 . 001 ) 


Table 11: Simulated Type I error rates of the bootstrap doo-test (4.8) for the equivalence 
of a quadratic and a linear model defined by (5.6). The threshold in (4.6) is chosen as 
£oo = 1 ■ The numbers in brackets show the simulated Type I error rates of the test proposed 
by Gsteiger et al. (2011). 




a - 0.05 

a = 0.1 





("- 1 , * 12 ) 

doo = 25 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

0.4 

0.290 ( 0 . 106 ) 

0.143 ( 0 . 011 ) 

0.213 ( 0 . 038 ) 

0.452 ( 0 . 210 ) 

0.256 ( 0 . 010 ) 

0.356 ( 0 . 105 ) 

( 10 , 10 ) 

0.2 

0.438 ( 0 . 198 ) 

0.206 ( 0 . 014 ) 

0.324 ( 0 . 088 ) 

0.619 ( 0 . 371 ) 

0.367 ( 0 . 083 ) 

0.473 ( 0 . 221 ) 

( 10 , 20 ) 

0.4 

0.313 ( 0 . 158 ) 

0.176 ( 0 . 023 ) 

0.284 ( 0 . 100 ) 

0.484 ( 0 . 314 ) 

0.295 ( 0 . 089 ) 

0.446 ( 0 . 202 ) 

( 10 , 20 ) 

0.2 

0.542 ( 0 . 329 ) 

0.262 ( 0 . 049 ) 

0.442 ( 0 . 197 ) 

0.694 ( 0 . 531 ) 

0.423 ( 0 . 140 ) 

0.602 ( 0 . 397 ) 

( 20 , 20 ) 

0.4 

0.500 ( 0 . 356 ) 

0.230 ( 0 . 092 ) 

0.377 ( 0 . 162 ) 

0.641 ( 0 . 572 ) 

0.377 ( 0 . 228 ) 

0.531 ( 0 . 344 ) 

( 20 , 20 ) 

0.2 

0.748 ( 0 . 661 ) 

0.430 ( 0 . 212 ) 

0.620 ( 0 . 415 ) 

0.858 ( 0 . 806 ) 

0.601 ( 0 . 406 ) 

0.764 ( 0 . 627 ) 

( 50 , 50 ) 

0.4 

0.879 ( 0 . 851 ) 

0.573 ( 0 . 448 ) 

0.767 ( 0 . 690 ) 

0.942 ( 0 . 926 ) 

0.733 ( 0 . 634 ) 

0.877 ( 0 . 821 ) 

( 50 , 50 ) 

0.2 

0.991 ( 0 . 986 ) 

0.828 ( 0 . 800 ) 

0.936 ( 0 . 929 ) 

0.998 ( 0 . 995 ) 

0.915 ( 0 . 899 ) 

0.973 ( 0 . 975 ) 


Table 12: Simulated power of the bootstrap doo-test (4.8) for the equivalence of a quadratic 
and a linear model defined by (5.6). The threshold in (4.6) is chosen as Soo = 1- The 
numbers in brackets show the simulated power of the test proposed in Gsteiger et al. (2011). 

bootstrap test proposed in this paper is able to establish equivalence between the curves 
with reasonable Type II error rates, if the total sample size is larger than 50. 


23 






a - 0.05 

q : = 0.1 



(A. A) 

(A, A) 

(" l ,' ra 2 ) 

d = d^o 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

1 

0.000 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

0.007 ( 0 . 000 ) 

0.019 ( 0 . 000 ) 

0.010 ( 0 . 000 ) 

( 10 , 10 ) 

0.75 

0.000 ( 0 . 000 ) 

0.008 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

0.013 ( 0 . 002 ) 

0.041 ( 0 . 000 ) 

0.020 ( 0 . 000 ) 

( 10 , 10 ) 

0.5 

0.015 ( 0 . 001 ) 

0.040 ( 0 . 000 ) 

0.016 ( 0 . 000 ) 

0.050 ( 0 . 005 ) 

0.104 ( 0 . 000 ) 

0.054 ( 0 . 002 ) 

( 10 , 20 ) 

1 

0 . 000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.005 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.003 ( 0 . 000 ) 

( 10 , 20 ) 

0.75 

0.001 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.005 ( 0 . 000 ) 

0.023 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

( 10 , 20 ) 

0.5 

0.018 ( 0 . 000 ) 

0.016 ( 0 . 000 ) 

0.012 ( 0 . 000 ) 

0.045 ( 0 . 000 ) 

0.051 ( 0 . 000 ) 

0.037 ( 0 . 000 ) 

( 20 , 20 ) 

1 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.006 ( 0 . 000 ) 

( 20 , 20 ) 

0.75 

0.000 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.003 ( 0 . 000 ) 

0.010 ( 0 . 002 ) 

0.002 ( 0 . 000 ) 

( 20 , 20 ) 

0.5 

0.006 ( 0 . 001 ) 

0.019 ( 0 . 000 ) 

0.016 ( 0 . 000 ) 

0.027 ( 0 . 001 ) 

0.051 ( 0 . 000 ) 

0.046 ( 0 . 000 ) 

( 50 , 50 ) 

1 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.001 ( 0 . 000 ) 

( 50 , 50 ) 

0.75 

0.006 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.000 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.007 ( 0 . 000 ) 

0.002 ( 0 . 000 ) 

( 50 , 50 ) 

0.5 

0.003 ( 0 . 000 ) 

0.005 ( 0 . 000 ) 

0.004 ( 0 . 000 ) 

0.018 ( 0 . 000 ) 

0.027 ( 0 . 000 ) 

0.034 ( 0 . 000 ) 


Table 13: Simulated Type I error rates of the bootstrap doo-test (4.8) for the equivalenee of 
two shifted EMAX-models defined by (5.2). The threshold in (4.6) is ehosen as Soo = 0.5. 
The numbers in braekets show the simulated Type I error rates of the test proposed by 
Gsteiger et al. (2011). 




a - 0.05 

q : = 0.1 



(A, A) 

(A, A) 

(" 1 , " 2 ) 

d = doo 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 0 . 25 , 0 . 25 ) 

( 0 . 5 , 0 . 5 ) 

( 0 . 25 , 0 . 5 ) 

( 10 , 10 ) 

0.25 

0.062 ( 0 . 000 ) 

0.050 ( 0 . 000 ) 

0.053 ( 0 . 000 ) 

0.147 ( 0 . 000 ) 

0.118 ( 0 . 000 ) 

0.118 ( 0 . 000 ) 

( 10 , 10 ) 

0.1 

0.100 ( 0 . 000 ) 

0.070 ( 0 . 000 ) 

0.099 ( 0 . 000 ) 

0.195 ( 0 . 000 ) 

0.137 ( 0 . 000 ) 

0.190 ( 0 . 000 ) 

( 10 , 10 ) 

0 

0.109 ( 0 . 000 ) 

0.090 ( 0 . 000 ) 

0.092 ( 0 . 000 ) 

0.216 ( 0 . 000 ) 

0.143 ( 0 . 000 ) 

0.176 ( 0 . 000 ) 

( 10 , 20 ) 

0.25 

0.077 ( 0 . 000 ) 

0.077 ( 0 . 000 ) 

0.074 ( 0 . 000 ) 

0.157 ( 0 . 000 ) 

0.142 ( 0 . 000 ) 

0.141 ( 0 . 000 ) 

( 10 , 20 ) 

0.1 

0.118 ( 0 . 001 ) 

0.077 ( 0 . 001 ) 

0.100 ( 0 . 000 ) 

0.227 ( 0 . 002 ) 

0.163 ( 0 . 002 ) 

0.176 ( 0 . 000 ) 

( 10 , 20 ) 

0 

0.151 ( 0 . 001 ) 

0.078 ( 0 . 001 ) 

0.118 ( 0 . 000 ) 

0.275 ( 0 . 004 ) 

0.165 ( 0 . 003 ) 

0.213 ( 0 . 000 ) 

( 20 , 20 ) 

0.25 

0.085 ( 0 . 000 ) 

0.060 ( 0 . 000 ) 

0.076 ( 0 . 000 ) 

0.171 ( 0 . 005 ) 

0.134 ( 0 . 001 ) 

0.162 ( 0 . 000 ) 

( 20 , 20 ) 

0.1 

0.158 ( 0 . 000 ) 

0.090 ( 0 . 000 ) 

0.112 ( 0 . 000 ) 

0.309 ( 0 . 007 ) 

0.184 ( 0 . 002 ) 

0.220 ( 0 . 001 ) 

( 20 , 20 ) 

0 

0.178 ( 0 . 003 ) 

0.108 ( 0 . 001 ) 

0.120 ( 0 . 003 ) 

0.324 ( 0 . 013 ) 

0.209 ( 0 . 001 ) 

0.219 ( 0 . 008 ) 

( 50 , 50 ) 

0.25 

0.162 ( 0 . 023 ) 

0.086 ( 0 . 000 ) 

0.098 ( 0 . 006 ) 

0.283 ( 0 . 084 ) 

0.178 ( 0 . 007 ) 

0.218 ( 0 . 034 ) 

( 50 , 50 ) 

0.1 

0.390 ( 0 . 117 ) 

0.212 ( 0 . 002 ) 

0.232 ( 0 . 017 ) 

0.568 ( 0 . 325 ) 

0.349 ( 0 . 018 ) 

0.398 ( 0 . 101 ) 

( 50 , 50 ) 

0 

0.457 ( 0 . 157 ) 

0.211 ( 0 . 012 ) 

0.266 ( 0 . 032 ) 

0.630 ( 0 . 364 ) 

0.363 ( 0 . 033 ) 

0.438 ( 0 . 172 ) 


Table 14: Simulated power of the bootstrap doo-test (4.8) for the equivalenee of two shifted 
EMAX-models defined by (5.2). The threshold in (4.6) is ehosen as Soo = 0.5. The numbers 
in braekets show the simulated power of the test proposed in Gsteiger et al. (2011). 

6 Case study 

In this section we illustrate the new methodology with the dose hnding study described in 
Biesheuvel and Hothorn (2002). Female and male patients with Irritable Bowel Syndrome 
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(IBS) were randomized to one of the five doses 0 (placebo), 1,2,3, and 4. We use the blinded 
dose levels for conhdentiality. The primary endpoint was a baseline adjusted abdominal 
pain score with larger values corresponding to a better treatment effect. In total, 369 
patients completed the study, with nearly balanced allocation across the five doses. The 
data is available in the R package DoseFinding from Bornkamp et ah (2015). 

For this example, we used the linear model mi{x,(3i) = (3i^i + (3i^2X for the males and the 
Emax model m2{x,(32) = 1^2,1 + (^ 2 , 2 3 ^ 3 . for the females. Note that males and females 
were assigned to the same set of doses. The estimators in the linear and in the Emax model 
are given by /3i = (0.398,0.043) and P 2 = (0.220,0.517,1.396), respectively. The left part 
of Figure 2 displays the htted dose response models for both groups in the interval [0,4]. 
As it can also be observed from Figure 2 , the maximum distance between the two curves 
is /5i,i — (^ 2,1 = 0.1784, attained at a; = 0. 


£00 

a - 0.05 

a = 0.1 

0.3 

0.1293 

0.1628 

0.35 

0.1578 

0.1972 

0.4 

0.1867 

0.2322 


Table 15: Quantiles of the bootstrap test (4.8) (from 5000 replications) in the IBS case 
study for different values of the threshold Soo in the hypothesis (4.6). 

We hrst compare males and females with respect to the maximal deviation distance doo 
dehned in (2.3). For this purpose we apply the bootstrap test (4.8) proposed in Section 4, 
which is implemented with the R package TestingSimilarity from Moellenhoff (2015). 
In Table 15 we display the quantiles of the bootstrap test (4.8) for different values of 
the threshold £00 in the hypothesis (4.6). These values are calculated by 5000 bootstrap 
replications. For example, if e^c = 0.35 we obtain the quantiles go. 1,00 = 0.1972 for a = 
0.1 and go.o 5 ,oo = 0.1578 for a = 0.05, while the value of the test statistic is given by 
doo{/3i, ( 32 ) = 0.1784. Thus, we can reject the null hypothesis (4.6) at the signihcance level 
a = 0.1, but not at o; = 0.05. In the right part of Figure 2 we display the p-value 

i=l 

of the bootstrap test (4.8) as a function of the threshold £00 • We observe that the p-value 
corresponding to the choice £00 = 0.35 is given by 0.078. The plateau of the curve is a 
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Figure 2: Left panel: Fitted dose response curves for male (linear model) and female (Emax 
model) patients from the IBS case study. Right panel: p-values of the test (4.8) for different 
values of the threshold e^o in the hypothesis (4.6). 

consequence of the special construction of the test. If £oo < <^oo = 0.1784, the constrained 
estimators in (3.13) coincide with the unconstrained estimators and consequently the boot¬ 
strap data is generated with the same parameters for each of these different values of Eoc- 
In comparison, using the test of Gsteiger et ah (2011) discussed in Section 5, the maximum 
(minimum) of the upper (lower) conhdence band is given by 0.282 (—0.450) and 0.227 
(—0.390) for a = 0.05 and a = 0.1 respectively. Thus, the null hypothesis cannot be 
rejected at either of these two signihcance levels as the bands are not completely contained 
in the rectangle X x [—£ 005 ^ 00 ]- Therefore, similarity of the curves at a signihcance level 
of a = 0.1 can only be claimed if £00 is larger than 0.390. This illustrates again that the 
test proposed in Gsteiger et ah (2011) is conservative. 

It might be also of interest to compare both curves with respect to the squared distance 
(2.4), which is given by d 20 i,f 2 ) = 0.0126 for the IBS data. The 0.05 and 0.1 quantile 
of the bootstrap distribution are given by go. 05,2 = 0.0108 and qo.1,2 = 0.0169, respectively, 
for a choice of £2 = 0.05. Thus, we can reject the null hypothesis (3.1) at signihcance 
level a = 0.1 but not at a = 0.05. In this case the p-value is given by 0.059. Finally, we 
illustrate the application of the asymptotic test (3.12). The variance estimator dehned in 
(3.10) is given by = 0.0010 and for 62 = 0.05 we obtain the critical values —0.0022 
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(a = 0.05) and 0.0093 (a = 0.1) for the test (3.12). Therefore the null hypothesis cannot 
be rejected at either of these two signihcance levels and the corresponding p-value is given 
by 0.1193. These hndings coincide with the results of the simulation study, which shows 
that the bootstrap test (3.16) is more accurate. 
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7 Appendix: Technical details 

The theoretical results of this paper are proved under the following assumptions. 

Assumption 7.1. The errors are independent, have finite varianee a‘1 and expectation 
zero. 

Assumption 7.2. The covariate region X C is compact and the number and location 
of dose levels ki does not depend on n^, £ = 1 , 2 . 

Assumption 7.3. All estimators of the parameters (Ii, (I-z are computed over compact sets 
Bi C and B 2 C . 

Assumption 7.4. The regression functions mi and m 2 are twice continuously differen¬ 
tiable with respect to the parameters for all bi, 62 in neighbourhoods of the true parameters 
I3i,l32 and all x E X. The functions {x,he) h-)■ mt{x,b^) and their first two derivatives are 
continuous on X x B^. 
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Assumption 7.5. Defining := ~ we assume that 

for any u > 0 there exists a constant Vu,e > 0 such that 


lim inf inf inf {b) > Vu/ 

ni,n2—>-c>o a£Bi \b—a\>u ’ 


£= 1 , 2 . 


In particular, under Assumptions 7.1 - 7.5 the least squares estimator can be linearized. 
To be precise, consider arbitrary sequences (/3f,n)neN and (o-£,n)neN and M"*" such that 
/3e,n —t fie. and ae^n —)■ > 0 as ni, n 2 —)■ oo (£ = 1, 2) and denote by data of the form 
given in (2.1) with fi^ replaced by fi^^n and rje^ij independent and identically distributed (for 
each hxed n) with mean 0 and finite variances Then the least squares estimators fi^""'' 
computed from satisfy 


k£ '^£,1 


fie,i,j + op(l), 

V i=l j=l 


= 1 , 2 , 


(7.1) 


where the functions are given by 




he,, 


= 1 . 2 , 


(7.2) 


and Yie,n takes the form 

ke 


1 




= 1,2. 


(7.3) 


e,n j=i 


Proof of Theorem 3.1: Let £ 00 (^ 1 ^) denote the space of all bounded real valued functions 
of the form : A —)■ M. The mapping $ : dehned by 

{ P^ l_^ 

rj, rj, (’^• 4 ) 

is continuous due to Assumptions 7.2-7.4, where we use the Euclidean and the supremum 
norm on and £ 00 (A), respectively. Consequently, the continuous mapping theorem 

[see Van der Vaart (1998)] and (3.7) yield that the process 

{y/nGnix)}^(.x ■= {Vn{i^^mfix,bfi\^^^^^f{fii-fii) - i^m2ix,b2)\^^^^^fifi2 - fi2))}^^;^ 
converges weakly to a centered Gaussian process {G{x)}x&x in £ 00 (V), which is dehned by 

G)*) = (»r™i(^.'>OL,=,/VASy-'^Z. - (7.5) 
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where and Z 2 are independent pi- and p 2 -diniensional standard normal distributed 
random variables, respectively, i.e. Zi ~ A/'(0, JpJ, £ = 1 , 2 . A straightforward calculation 
shows that the covariance kernel of the process {G{x)}x^x is given by (3.11). Now a Taylor 
expansion gives 

Pn{x) := {m,{xJ,)-m,{x,/3,)) - {m^ixj^) - m^ix,/S^)) = G^ix) + (7.6) 

uniformly with respect to a; G A", and it therefore follows that 

{y/npn{x)}x(.x ^ . (7.7) 


Recalling the dehnition of A(a;,/di,/ 32 ) in (2.5), observing the representation 




A‘^{x, ^i, P2)dx 


A‘^{x, f3i, /32)dx 



J X ^ 

A{x,/3i,/32)pn{x)dx, 


^ 'jy 

and from the continuous mapping theorem we obtain y/n{d 2 —d 2 ) —?■ 2 A(x, /3i, /32)G(x)dx, 
where G denotes the Gaussian process dehned in (7.5). Now it is easy to see that the dis¬ 
tribution on the right-hand side is a centered normal distribution with variance dehned 
in (3.10). This completes the proof of Theorem 3.1. □ 


Proof of Theorem 3.4: 

Proof of (1). First we will determine the asymptotic distribution of the bootstrap estimators 
jdl and Then we use similar arguments as given in the proof of Theorem 3.1 to derive 
the asymptotic distribution of the statistic d^ (appropriately standardized). Finally, in a 
third step, we establish the statement (3.17). 

Recall the dehnition of the estimators in (3.13) and note that it follows from Assumptions 
7.1-7.5 that d 2 > £2 under the null hypothesis . We distinguish two cases. If d2 > 62 , 
consistency of implies that ^2 > £2 with probability tending to one, and thus /di = 
with probability tending to one. Next consider the case ^2 = ^ 2 - Let M := {( 61 , 62 ) G 
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Bi X B 2 : d 2 {bi,b 2 ) = £ 2 } and note that {/3i, (32) ^ M. Define 

ki ki 

'■= '^—{me{xe,i,l3e)-me{xe,i,b))^, 7 ^ := — Y] 

1^1 7^1 

^ ki 

i^eib) := —Y] y^(m£(a;£,j, /Se) + Ve,ij “ mixe,i, b))"^. 

Tip 

^ i=l 3=1 


By definition (see (3.13) and (3.14)), we have = argniin(fej^ 62 )gM More¬ 


over, 


and 


arg min Y 7^i{b() = axg min Y- It)^ 

{bi,b2)&M ' ^ ''f'- M ' ^ 


i=l 


(bi,b2)£M 


e=i 


'^£A 


^e{be) -7e = w\^) + 2 y^(m£(a;£,i, Z^^) - mi{xe^i,b))—Y,Ve,i,j- 

i=l ^ j=l 

Observing that the terms \m(,{x^, 3() —^(.{.xi^i,b)\ are uniformly bounded (with respect to 
b & B^ and x^^i G df) it follows that 


Rn ■= V'sup \ 2Yi^ei^i,i, (de) - mi{xi^i,b))—Y^Vi,i,j 

7^1 ^ ^ 


Op(l) 


since maxj=i ...^fc 2 \^f.Yj=\V(.,i,j\ = op(l). By similar arguments as given after (7.15), we 
obtain 1/3^ — f3(\ = op(l), £ = 1,2 (recall that (/3i,/32) G M). Since for 7 = 1,2 we have 
\l3i — (3(\ < \l3e. — I3(\ + \l3i — I3(\ and it follows from consistency of (3^ that 


^ P 

(3i —)■ /3e ^ = 1, 2, whenever 3,2 > £ 2 - (7.8) 

For 7 = 1, 2 let = a(Yi^ij\i = 1,..., /c^, j = 1,... ,n(^i) denote the a-field generated by 
the random variables = 1,..., ke, j = 1,..., n£^i} and 3^ ;= a(3^i, 3 ^ 2 ) (note that we 

do not display the dependence of these quantities on the sample size). Given (7.8) and the 
consistency of a^, the discussion after Assumption 7.5 yields 




^meixe,i,, 


-I 


S-e 


+ Op(l) , 


£= 1 , 2 . 


where the pi x pi and p 2 x P 2 dimensional matrices ^ and E 2 ^ 


are defined by 


^ 2=1 
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Since by construction the are i.i.d. with unit variance and independent of 3^, the 
classical central limit theorem implies that, conditionally on y in probability 

Aa4(0,4J , £=1,2, (7.9) 


where the matrix is dehned in (3.4). Observing the dehnition of the statistic 
^2 = d20l.,^l) = f {mi{x,Pl) - m2{x,^l)Ydx, 


J X 

it now follows by the same arguments as given in the proof of Theorem 3.1 that 

^(J;-d2 ) ^ A7(0,l) (7.10) 

<7(^2 

conditionally on 3^ in probability. Now recall that qa ^2 is the a-quantile of the bootstrap 
statistic ^2 conditionally on y and note that, almost surely. 


a = 


<qa,2\ 3^) = P 


( Vn{d2 


do 


V 


< 




a, 2 


do 


(^d2 




y 


(7.11) 


Letting := \/n{qa ,2 — d 2 )/(Jd 2 if follows from (7.10), (7.11) and Lemma 21.2 in Van der 
Vaart (1998) that 

(7.12) 


where Ua denotes the a-quantile of the standard normal distribution. This relation implies 
for any a < 0.5 that 

lim P(gQ ,,2 — d 2 > 0 ) = lim P {pa > 0) = 0. (7-13) 

n^oo ’ n^oo 

After these preparations we are able to prove the first part of Theorem 3.4, i.e. we show 
that the bootstrap test has asymptotic level a as specihed in (3.16). It follows from 
(3.13) that in the case d 2 = d 2 {( 3 i,/ 32 ) > £2 the constrained estimators /5i and /52 coincide 
with the unconstrained OLS-estimators /3i and P 2 , respectively. This yields in particular 
^2 = d 2 {Pi, P 2 ) = d 2 whenever 62 > £ 2 - 
If d 2 > £2 we have 


2 < 90 , 2 ) — 

< 

< 


2 < qa,2-, d2 ^ £ 2 ) + P(d2 < qa,2, ^2 < S 2 ) 
2 < qa,2, d2 = d2) + P(d2 < £ 2 ) 


^ ^ , Tn,i Vn{d2-d2) ^ ^{e2-d2) 

2 < qa,2} + Jr I - < 


<^d2 


(^d2 
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Observing that 82 — d 2 < 0, it now follows from Theorem 3.1 that the second term is of 
order o(l). On the other hand, we have from (7.13) that the hrst term is of the same order, 
which gives lim„j^„ 2 _^oo lP(d 2 < = 0 and proves the hrst part of Theorem 3.4 in the 

case d 2 > 82 - 

For a proof of the corresponding statement in the case ^2 = £2 we note that it follows again 
from (7.13) 


P(d 2 < qa, 2 ) 


P(d2 < ga,2, d2 > 82) + P(d2 < g«,2, 

¥02 < qa,2, d2 = d2) + ¥02 < qa,2, 

¥02 < qa,2, d2 = 0) + ¥02 < qa,2: 

^/ y/n02 - d2) ^ Vn{qa^2 - ^ 2 ) J 
^ <7^2 '^d 2 

^r Vn02 - d2) ^ Vn(ga,2 - ^2) \ 

^ ^d 2 ^d 2 ^ 


d 2 < £ 2 ) 

d2 = £2) ~ P(d2 < ?a,2, <^2 
0 — £2) + ®( 1 ) 

= £2) + 0(1) 


- ¥02 - d2 < qa,2 - ^2, d2 > £2) + 0(1), 


^ 2 ) 


(7.14) 


where the third equality is a consequence of the fact that y/n 02 — ^ 2 ) is asymptotically 
normal distributed, which gives ¥02 < qa, 2 ,d 2 = £ 2 ) < ¥02 = £ 2 ) = o(l). If ^2 > £2 h 
follows that d 2 = d 2 > £2 = ^2 and consequently the second term in (7.14) can be bounded 
by (observing again (7.13)) ¥02 — d 2 < qa ,2 — ^202 > £ 2 ) < ¥{qa ,2 — ^2 > 0) = o(l). 
Therefore we obtain from Theorem 3.1, (7.12) and (7.14) that hm„^oo P(d 2 < ^ 0 , 2 ) = 
‘h(Ma) = a, which completes the proof of part (1) of Theorem 3.4. 

Proof of (2). Finally, we consider the case ^2 < £2 and show the consistency of the 
test (3.16). Theorem 3.1 implies that ^ 2 — >d 2 . Since ^2 < £ 2 -, there exists a constant 
(5 > 0 such that ¥02 < £2 — <5) —t 1. Hence the assertion will follow if we establish that 
¥{qa ,2 > £2 — <^) —^ 1- To show this, denote by Fb0b2 si S 2 conditional distribution 
function of d^ given 0 = bg, an = sn {i — 1) 2). Since P(max£=i ^2 < r) —)■ 1 for any 

r > 0, it suffices to establish that for some r > 0 


sup {Tiil2,si,s2(^2 - 5) I e 


1 , 2 ; max \s(, — < r} —)■ 0 . 


By uniform continuity of the map ( 61 , 62 ) t ^ 2 ( 61 , 62 ) it suffices to prove that for all 77 > 0 
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and £ = 1,2 


sup 


-be\>n\h^ h^cTk ^ Sky 1,2) (bkySe) : |sf - (7f| < r,6f G 1 ^ 1,2| -l 0. 

(7.15) 


We will only prove the above statement for £ = 1 since the case £ = 2 follows by exactly 
the same arguments. For i = I,ki,j = 1, let Cjj i.i.d. ~ A/’(0,1) and dehne 


iilw ■= T. — {mi{xi^i, a) - mi{xi^i, b)Y , % := — 


fci 


ni, 


ki ni^i 


se. 


i=l 


Til 
ki 


ni 


lyj! 


i=l j = l 


A,s(b) := a) + seij -mi(xi^i,b)f. 


Til 


i=i j=i 


By construction, the conditional distribution of PI given /5i = a, di = s is equal to 
the distribution of the random variable ba,s '■= dxgmmb^Bi'Pa,s{b)- On the other hand, 

argminfegB^ 'tpa,s{b) = argminfegBj(^/>a,s(&) -%), and 


ipaAb) -% + 2s^(mi(a;i,*,a) - 6))—^ 6*^. 


fci 


771,. 


i=l 


ni ^ , 
j=i 


Observing that the terms \mi{xi^i, a) — mpxi^i, b)\ are uniformly bounded (with respect to 
a,b E Bi and Xi^i E X) \t follows that 


Rn := sup sup 

|s—cri|<r a,b^Bi 


fci 


ni. 


2s^{mi{xi^i,a) - 6)) — ^ 


i=l 


ni 


j=i 


since maxj=i^.,.^fci IX]j=i jI = op(l). Now we obtain from Assumption 7.5 that, for 
sufficiently large n, 

sup P(|/5i - a| > r]\/3i = a, di = s) < sup P(|&a,s - a\ > rj) < P(i?„ > v^/A) = o(l). 

(a,s):|5—(Ti |<r (a,s):|5—cri |<r 


Thus (7.15) follows, which completes the proof of Theorem 3.4. 


□ 


Proof of Theorem 4.1: Recall the dehnition of the estimator doo in (4.1) and dehne the 
random variables 

Dn = Vn {doo - doc) = Vn {max\mi{x,Pi) - m2{x,P2)\ - doo), (7.16) 

Zn = \/n {max\mi{x,Pi)-m2{x,P2)\ - doo)- (7-17) 

x££ 
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We will use similar arguments as given in Raghavacliari (1973) and show that 

Rn = Dn-Zn = or{l), (7.18) 

Zn^Z, (7.19) 

which proves the assertion of Theorem 4.1. For a proof of (7.18) we recall the dehnition of 
the ’’true” difference A{x, (3i, (32) in (2.5) and the dehnition of the process {pn{,x)}x&x in 
(7.6). It follows from (7.7) and the continuous mapping theorem that 

lim P( max |p„(a;)| > a„) = 0 (7.20) 

ni,112^00 ' x£X ' 

as ni,n 2 —)■ cxd, n/rii —)■ A G (l,cx)), where = \ogn|^/n. By the representation p„(a;) = 
Gn{x) + op(n“^/^) uniformly in x G X and the dehnition of in (7.6) we have for every 
7 ] > 0 

lim lim P( sup ^/n\pn (x) -Pniy)\ > v) =0, (7.21) 

54.0 ni,n 2 ^>oo / 

where || ■ || denotes a norm on X G In the following discussion dehne the sets 

= {x e X \ \ t - A(a;,/3i,/32)| < On} (7.22) 

and Sn = S(( U S~, then it follows from the dehnition of and (7.20) that 

0 < Rn = A/nf max |A(a;, /5i,/52)| - max |A(a;,/3i,/32)|) 

V x&X xGS / 

= ^/n( max |A(a;,/5 i,/ 52)| - max |A(a;,/5 i,/ 52)| ) + op(l) 

V XGSn. x££ / 

< max(i?",R+)+ 0 p(l), 

where RX = ydr(max^g^P |A(a;,/5i,/52)| — maxa-g^-^ |A(a;,/3i,/52)|). We now prove the es¬ 
timate RX = op(l), which completes the proof of assertion (7.18). For this purpose we 
restrict ourselves to the random variable R(( (the assertion for R~ is obtained by similar 
arguments). Note that C S(( and therefore it follows that 

0 < = A/nf max A(a;,/5i,/32) - max A(a;,/5i,/32)) + op(l) (7.23) 

V x££+ xe£+ J 

< max y/npn{x) — max y/npn{x) -f ^/n\ max A(a;, (3i, ( 32 ) — doo [ + op(l) 

x&£+ x(i£+ I x&£+ J 

= max \/npn{x) — max \/npn{x) + op(l). 
x&£+ x&£+ 
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Now define for 7 > 0 the set = {x&X\3y^£^ with ||a; — y\\ < 7 } and 

the constant 5n = 2inf {7 > 0 | C £^+( 7 )}. Obviously C £~^{Sn) and the sequence 
((5n)neN is decreasing, such that S := lini„_^oo exists. By the dehnition of (5„ we have £^ ^ 
£^"'"(^n/4). Consequently, there exist G C df such that \\xn — y\\ > 5n/4 for all y & £~^ 
and all n E N. The sequence (a;n)„gN contains a convergent subsequence (because X is 
compact), say (x^J^eN which satishes lim^^oo Xn^ = x e df, doo = hm^^oo A{xnk,Pi, 1^2) = 
A(a;, /di,/d 2 ). Consequently, x G T"*", but by construction Wxn^ — a;|| > 5„/4 for all k E N, 
which is only possible if £ = lim„_,.oo = 0 . 

Now it follows from inequality (7.23) for the sequence ((5n)neN 

op(l) < < max \/npn{x) — max\£npn{x) + op(l) 

X&£+(5n) X&S + 

< max y/n\pn{x) - Pn{y)\ + (¥{l) = 0p{l), 

\\y-x\\<s„ 

where the last estimate is a consequence of (7.21). A similar statement for R~ completes 
the proof of (7.18). 

For a proof of the second assertion (7.19) we dehne the random variable 

Zn = max I ma,x^/npn{x); max(—vdrp„(a;))}, 

^ xG£+ xG£- 

~ T) 

then it follows from (7.7) and the continuous mapping theorem that Zn —> where the 
random variable Z is dehned in (4.4). Observing the uniform convergence in (7.20) we 
have as ni, n 2 —)■ C )0 

¥{Zn <t)= ¥[Zn < t, max \Pn{x)\ < + 0(1) = ¥[Zn < t, max \Pn{x)\ < ^) + 0(1) 

= ¥{Zn < t) + 0(1) = F{Z <t)+ 0(1). 

This proves the remaining statement and completes the proof of Theorem 4.1. □ 

Proof of Theorem 4.2: We begin by noting that the statement (4.11) follows by exactly 
the same arguments as given in the proof of (3.18) in Theorem 3.4 once we note that the 
mapping ( 61 , 62 ) t doo{bi, 62 ) is uniformly continuous. The details are omitted for the sake 
of brevity. 
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Throughout the remaining proof, let /3i and 132 denote the estimators dehned by 




/3e if doo > £c 

i3f if doo < £oo 


= 1 , 2 , 


where I3i, (32 denote the OLS-estimators of the parameters I3i, (32 under the constraint 
dooil3i:/32) = £oo and dehne 


Pnix) ■= - {m2{x,/32)-m2{xj2)), 


G*nix) = (J-mi(a;,6i) , s - A) - (J-m 2 (a;, 62 ) 


\T/ p* 


d 


\T/ p* 


^2=^2' 


yni hij 

Similarly to the proof of Theorem 3.1, it is possible to establish that 

3 p 


'•£ i = 1,2, whenever d^o > £oo, 
Pni^) = G;(a;) + op(n"^/2) 




(7.24) 

(7.25) 

(7.26) 


uniformly with respect to a; G df, where {G{x)}^^^ denotes the Gaussian process dehned 
in (7.5). Here, the weak convergence in (7.25) holds conditionally on 3^ in probability as 
well as unconditionally. 

From now on assume that the null hypothesis d^o > £oo is satished, and dehne 

7^* = {x e X \mi{x,j3l)-m2{x,j3l) = Tdlo], 

£(^ = {x e X \ mi{x,( 3 i) -m2{x,^2) = Tdoo}, 

where 

d*^ = sup\mi{x,^l)-m 2 {x,^ 2 )\ , doo = sup |mi(a;,/3i) - 7712 ( 0 ;,/32)| • 

From (7.24) and the continuous mapping theorem we obtain the existence of a sequence 
(7n)eN such that 7 n —t 0 and 

sup \A{x,f3i,(32 ) - A{x,Pi,/32 )\ = op( 7 „), sup \A{x, (31, - A(a;,/5i,/^a)! = op(an),(7.27) 

x^Pd xGA' 

where a„ = logn/^/n and the second statement follows from (7.25). Moreover, from the 
representation (7.25) we have for every 77 > 0 

lim lim sup Pf 07 sup |p* (x) — p* ( 7 /)| > 77 j = 0. (7.28) 

AO ni,n2—>-co ^ |P_j,||<i5 / 
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Now define bn = max{ 7 „, a„} and consider the sets 

J^n = {x ^ X \ I ± doo - A(a;,/3i,/32)| < &n} 
and J^n = U J^n ■ Additionally, define the set 

= {x E X \ 3 y ^ with ||a; — y\\ < 7 } 


for 7 > 0. At the end of the proof we shall show that there exists a sequence <5^ —^ 0 such 
that 

P(4± U Xt C J^n C S^iSn)) -E 1. (7.29) 

In the special case = 1 with £ = {xq} for some Xq E X we shall prove that additionally 

\/n{d*^ — doo) —t G{xq) conditionally on y in probability. (7.30) 


Given (7.29) and (7.30) we prove (4.9) and (4.10). For a proof of (4.9) note that qa,oo is the 
a-quantile of the bootstrap test statistics dy conditionally on 3^. Thus the a-quantile of 
the distribution of \/n{dy — d^o) conditionally on 3 ^ is of the form pa,cxi ■= ^/n{qa,oo — doo) 
and, by (7.30), satishes pa ,00 —t Za^ where Za denotes the a-quantile of the distribution of 
G(a;o). With := Var(G(a;o)), it now follows from Lemma 21.2 in Van der Vaart (1998) 
Pa/doc —^ where Ua denotes the a-quantile of the standard normal distribution. This 
result is the analogue of (7.12) in the proof of Theorem 3.4, and (4.9) now follows by exactly 
the same arguments as given in the proof of (3.17) in Theorem 3.4. 

Next, we derive a preliminary result that will be used to prove (4.10). Dehne for the 
sequence 5n from (7.29) 




■\/nmax| max Pn{x), 

^ X&£+(Sn) 


max (-p;(a;))} 

X&£-{5ri) 


From (7.29) we obtain 



Moreover, 


max |A(a;,/ 3 i,/ 32 )| - doo 

x£X 

max{ max A(a;,/3^,/d^), max {—A{x, PI, P 2 ))} — doo + (7.31) 

xe£+{Sn.) X&£-{Sn) 


max (±A{x,Pl,p;) 

x££^{Sn) 


< max 

x££^(Sr. 


(±A(a;, /3i, T A(a:, /5i, P 2 )) 


39 



and thus 


y^(C-rfoo) <2: + op(l). (7.32) 

Denoting by pa,oo the a-quantile of y/n{dl^ — doo) conditional on the data we have ga,oo = 
doo "1“ ^ ^ Pa^oo' 

Dehne F* as the distribution function of y/n{dl^ — doo) conditional on the data, as the 
distribution function of conditional on the data and Fz as the distribution function 
of Z. By the definition of Z and the results in Tsirel’son (1976) the function Fz can 
have at most one jump, and this jump can only be located at its left support point. 
Since Fz is continuous at qz,a and qz,a < 0 there exists to < 0 such that Fz(to) < a 
and Fz is continuous on [to,C)o). Since Z* converges weakly to Z conditionally on the 
data in probability it follows that sup^>^g l-^n(t) — Fz(t)\ = op(l). From (7.32) we have 
P(F*(t) > Fn(t — e) Vt) — )■ 1 for any e > 0, and the uniform continuity of Fz on [to, cxd) 
yields P(F*(t) > Fz{t) —e Vt > to) —)■ 1 for any e > 0. Let qz,a+s denote the a + 5 quantile 
of Fz- For arbitrary d > 0 it follows that qz,a +5 > to and thus 

o(l) = F{F*{qz,a+s) > Fz{qz,a+5) - d/3) = F{F*{qz,a+s) > a + 2d/3). 

Thus, by dehnition of pa,ooi we have 

IP(Pq,oo > qz,a+5) = o(l) Vd > 0. (7.33) 

Specihcally, choosing d > 0 with qz,a +5 < 0 we obtain 

IP(ga,oo > doo) = IP(Pa,oo > 0) = o(l). (7.34) 

Given (7.33) and (7.34) we are ready to prove (4.10). First consider the case doo = ^oo and 
note that it follows from (7.34) that 

lP(doo ^ 9o,oo) P(doo ^ ?a,oo) doo '^oo) T P(doo ^ qa,coi doo ^ '^oo) 

P(doo ^ qa,ooi doo '^oo) T P(doo ^ qa,ooi doo ^ '^oo) 

P^y^(doo doo) ^ V^^(^a,oo doo), doo ^(f) 

^ P^y^(doo doo) ^ v^^(dtt,oo doo)^ T ^(1) 

P^y^(doo doo) ^ PajOo'j T ®(1)) 
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where the second equality follows since on the event doo > £oo we have doo = d^o and the 
third equality follows from (7.34). From (7.33) we obtain for any 5 > 0 

F(yn{doo - doc) < Pa,oc^ < P(^Vn(doc - doc) < gz,a+5^ + o(l) Ct + (5 

because the distribution of £ is continuous at qz,a+s- Since <5 > 0 was arbitrary (4.10) 
follows in the case doc = £oc- 

Next consider the case doc > £oc- We have 

lP(doo ^ ?o,oo) lP(doo ga,oci doc ^ ^oo) IP(doo ^ ^a,oo) doc ^ '^oo) 

^ ]P(doo ^Q:,oo7 doc doc) 4“ lP(doo '^oo) 

< P(cioo < qa,oc) + P(yn(doo - doc) < Vn{eoc - doc)) = o(l) 

where the hrst term in the last line is of order o(l) by (7.34) and the second term vanishes 

since ^/n(^oo — doc) —t oo while \y/n{doc — doo)| converges weakly and thus is of order Op(l). 
This completes the proof of (4.10). 

It remains to establish (7.29) and (7.30). We begin with a proof of (7.29). Without loss of 
generality, we only prove the existence of — )■ 0 with P(T+ U C C S~^{6n)) — )■ 1. 
We may assume that ^ 0, otherwise S~^{6n) is empty and it is straightforward to show 
that will be empty with probability converging to one. Dehne 6n = 2- inf {7 > 

0 I C £^{j)}. Obviously C £^{5n) provided that 5n > 0. Moreover, without loss 
of generality we assume that the sequence bn is non-increasing. As a consequence ((5n)neN 
is also non-increasing, such that 5 := lim„_^oo exists. By the dehnition of 5n we have 
J^n 't- £^{.dnl^) unless = 0 in which case <5 = 0. Consequently, for each n with (5„ > 0 
there exists an Xn € J^n ll^n ~ ^oll > <^n/4 for all Xq G £^. As X is compact, 

there exists a convergent sub-sequence, say {xnf,)k£n with limit hmfc^.oo = x E X and 

doo = lim A{xnk,/di,/32) = A{x,/3i,/32). 

ni , 712^00 

Consequently, x G £~^, and from \\xn^. — a;o|| > d/4 for all k E N,Xo E £~^ we obtain 
S lim^_,.oo dn 0. 

Next, note that by (7.27) 

sup |mi(a;,/3i) - 7712 ( 2 ;, Z?^) - {mi{x,/3i) -m2{x,/32))\ = op(&n) 
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Hence 


c J^n) < ipf sup \mi{x,^l) - m2{x,^l) - {mi{x,l3i) - m2{x,l32))\ < bn/s) -)■ 1. 

The convergence P(T^ C J^+) —)■ 1 follows by similar arguments, which establishes (7.29). 

Finally, it remains to prove (7.30) for the special case that E- = {xq}. Assume without 
loss of generality that E = and E~ is empty. This implies that , E~ will be empty 
with probability converging to one and will contain at least one point with probability 
tending to one. Together with (7.31) we obtain 

■= y/n{dl^ - doo) = ^/n{ max A(a;,/d^) - doo) + op(l), 

X&e+{&ri) 

and thus 

op(l) < D* - yn(max A(a;,/3i,/32) - doo) 

= ^/n( max A(a;,/S^) — maxA(a;,/d^)) + op(l) 

Va:e£’+(<5„) x&S+ 7 

< max ^/npl^{x) — ^/np*J^{x) + y/n\ max A(a;,/3i,/32) — doo ? + op(l) 

a;e£’+(5„) x&S+ f xe£+(<5„) ) 

= max A/np* (x) - max \Enp*^{x) + op(l) 

xe€+(5„) xes+ 

= op(l) 

where the last equality follows from (7.28) and the second-to last equality is a consequence 
of (7.29). Thus 

v/n(d^-doo) = Vnf max A(a;,/ 5 i,/ 52 )-doo)+ 0 p(l) = ^nmaxp* (a;)+op(l) = ^/np^(xo)+op(l), 

where the last equality follows from a combination of (7.28) and (7.29). Since y/npl^{xo) 
converges weakly to G(a;o) conditionally on the data in probability the statement (7.30) 
follows. This completes the proof of Theorem 4.2. □ 
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